Precision and Mathematical Form in First and Subsequent Mentions of Numerical Facts and their Relation to Document Structure
نویسندگان
چکیده
In a corpus study we found that authors vary both mathematical form and precision1 when expressing numerical quantities. Indeed, within the same document, a quantity is often described vaguely in some places and more accurately in others. Vague descriptions tend to occur early in a document and to be expressed in simpler mathematical forms (e.g., fractions or ratios), whereas more accurate descriptions of the same proportions tend to occur later, often expressed in more complex forms (e.g., decimal percentages). Our results can be used in Natural Language Generation (1) to generate repeat descriptions within the same document, and (2) to generate descriptions of numerical quantities for different audiences according to mathematical ability.
منابع مشابه
A Fact-aligned Corpus of Numerical Expressions Conference Item a Fact-aligned Corpus of Numerical Expressions
We describe a corpus of numerical expressions, developed as part of the NUMGEN project. The corpus contains newspaper articles and scientific papers in which exactly the same numerical facts are presented many times (both within and across texts). Some annotations of numerical facts are original: for example, numbers are automatically classified as round or non-round by an algorithm derived fro...
متن کاملA Fact-aligned Corpus of Numerical Expressions
We describe a corpus of numerical expressions, developed as part of the NUMGEN project. The corpus contains newspaper articles and scientific papers in which exactly the same numerical facts are presented many times (both within and across texts). Some annotations of numerical facts are original: for example, numbers are automatically classified as round or non-round by an algorithm derived fro...
متن کاملThe Open University ’ s repository of research publications and other research outputs A fact - aligned corpus of numerical expressions
We describe a corpus of numerical expressions, developed as part of the NUMGEN project. The corpus contains newspaper articles and scientific papers in which exactly the same numerical facts are presented many times (both within and across texts). Some annotations of numerical facts are original: for example, numbers are automatically classified as round or non-round by an algorithm derived fro...
متن کاملThe Effect of Written Corrective Feedback on the Accuracy of Output Task and Learning of Target Form
The effect of error feedback on the accuracy of output task types such as editing task, text reconstruction task, picture cued writing task, and dictogloss task, has not been clearly explored. Following arguments concerning that the combination of both corrective feedback and output makes it difficult to determine whether their effects were in combination or alone, the purpose of the present st...
متن کاملUNCERTAINTY DATA CREATING INTERVAL-VALUED FUZZY RELATION IN DECISION MAKING MODEL WITH GENERAL PREFERENCE STRUCTURE
The paper introduces a new approach to preference structure, where from a weak preference relation derive the following relations:strict preference, indifference and incomparability, which by aggregations and negations are created and examined. We decomposing a preference relation into a strict preference, anindifference, and an incomparability relation.This approach allows one to quantify diff...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009